Overview
Brought to you by YData
Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 3661 |
| Missing cells | 6626 |
| Missing cells (%) | 7.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.2 MiB |
| Average record size in memory | 632.2 B |
Variable types
| Categorical | 10 |
|---|---|
| Text | 3 |
| Numeric | 10 |
area is highly overall correlated with bathroom and 5 other fields | High correlation |
bathroom is highly overall correlated with area and 5 other fields | High correlation |
bedRoom is highly overall correlated with area and 5 other fields | High correlation |
built_up_area is highly overall correlated with area and 4 other fields | High correlation |
carpet_area is highly overall correlated with area and 5 other fields | High correlation |
facing is highly overall correlated with built_up_area | High correlation |
price is highly overall correlated with area and 7 other fields | High correlation |
price_per_sqft is highly overall correlated with price | High correlation |
property_type is highly overall correlated with bedRoom and 2 other fields | High correlation |
servant room is highly overall correlated with bathroom and 1 other fields | High correlation |
super_built_up_area is highly overall correlated with area and 7 other fields | High correlation |
store room is highly imbalanced (55.9%) | Imbalance |
others is highly imbalanced (50.1%) | Imbalance |
facing has 1039 (28.4%) missing values | Missing |
super_built_up_area has 1786 (48.8%) missing values | Missing |
built_up_area has 1987 (54.3%) missing values | Missing |
carpet_area has 1791 (48.9%) missing values | Missing |
area is highly skewed (γ1 = 29.73095613) | Skewed |
built_up_area is highly skewed (γ1 = 40.52309524) | Skewed |
carpet_area is highly skewed (γ1 = 24.32031436) | Skewed |
floorNum has 129 (3.5%) zeros | Zeros |
luxury_score has 459 (12.5%) zeros | Zeros |
Reproduction
| Analysis started | 2025-04-10 13:25:26.870138 |
|---|---|
| Analysis finished | 2025-04-10 13:25:43.795221 |
| Duration | 16.93 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
property_type
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 247.5 KiB |
| flat | |
|---|---|
| house |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.230265 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | flat |
|---|---|
| 2nd row | flat |
| 3rd row | flat |
| 4th row | flat |
| 5th row | flat |
Common Values
| Value | Count | Frequency (%) |
| flat | 2818 | |
| house | 843 | 23.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| flat | 2818 | |
| house | 843 | 23.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 2818 | |
| l | 2818 | |
| a | 2818 | |
| t | 2818 | |
| h | 843 | 5.4% |
| o | 843 | 5.4% |
| u | 843 | 5.4% |
| s | 843 | 5.4% |
| e | 843 | 5.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 15487 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| f | 2818 | |
| l | 2818 | |
| a | 2818 | |
| t | 2818 | |
| h | 843 | 5.4% |
| o | 843 | 5.4% |
| u | 843 | 5.4% |
| s | 843 | 5.4% |
| e | 843 | 5.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 15487 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| f | 2818 | |
| l | 2818 | |
| a | 2818 | |
| t | 2818 | |
| h | 843 | 5.4% |
| o | 843 | 5.4% |
| u | 843 | 5.4% |
| s | 843 | 5.4% |
| e | 843 | 5.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 15487 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| f | 2818 | |
| l | 2818 | |
| a | 2818 | |
| t | 2818 | |
| h | 843 | 5.4% |
| o | 843 | 5.4% |
| u | 843 | 5.4% |
| s | 843 | 5.4% |
| e | 843 | 5.4% |
society
Text
| Distinct | 675 |
|---|---|
| Distinct (%) | 18.4% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 292.7 KiB |
Length
| Max length | 49 |
|---|---|
| Median length | 39 |
| Mean length | 16.873497 |
| Min length | 1 |
Unique
| Unique | 308 ? |
|---|---|
| Unique (%) | 8.4% |
Sample
| 1st row | la vida by tata housing |
|---|---|
| 2nd row | umang monsoon breeze |
| 3rd row | ireo the grand arch |
| 4th row | m3m woodshire |
| 5th row | ambience creacions |
| Value | Count | Frequency (%) |
| independent | 486 | 5.0% |
| the | 350 | 3.6% |
| dlf | 219 | 2.3% |
| park | 209 | 2.2% |
| city | 163 | 1.7% |
| global | 153 | 1.6% |
| m3m | 152 | 1.6% |
| emaar | 151 | 1.6% |
| signature | 150 | 1.6% |
| heights | 134 | 1.4% |
| Other values (783) | 7473 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 6675 | 10.8% |
| 5982 | 9.7% | |
| a | 5840 | 9.5% |
| r | 4156 | 6.7% |
| n | 4140 | 6.7% |
| t | 3703 | 6.0% |
| s | 3465 | 5.6% |
| i | 3331 | 5.4% |
| l | 2929 | 4.7% |
| o | 2746 | 4.4% |
| Other values (32) | 18790 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 61757 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 6675 | 10.8% |
| 5982 | 9.7% | |
| a | 5840 | 9.5% |
| r | 4156 | 6.7% |
| n | 4140 | 6.7% |
| t | 3703 | 6.0% |
| s | 3465 | 5.6% |
| i | 3331 | 5.4% |
| l | 2929 | 4.7% |
| o | 2746 | 4.4% |
| Other values (32) | 18790 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 61757 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 6675 | 10.8% |
| 5982 | 9.7% | |
| a | 5840 | 9.5% |
| r | 4156 | 6.7% |
| n | 4140 | 6.7% |
| t | 3703 | 6.0% |
| s | 3465 | 5.6% |
| i | 3331 | 5.4% |
| l | 2929 | 4.7% |
| o | 2746 | 4.4% |
| Other values (32) | 18790 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 61757 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 6675 | 10.8% |
| 5982 | 9.7% | |
| a | 5840 | 9.5% |
| r | 4156 | 6.7% |
| n | 4140 | 6.7% |
| t | 3703 | 6.0% |
| s | 3465 | 5.6% |
| i | 3331 | 5.4% |
| l | 2929 | 4.7% |
| o | 2746 | 4.4% |
| Other values (32) | 18790 |
sector
Text
| Distinct | 113 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 265.7 KiB |
Length
| Max length | 26 |
|---|---|
| Median length | 9 |
| Mean length | 9.3223163 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | sector 113 |
|---|---|
| 2nd row | sector 78 |
| 3rd row | sector 58 |
| 4th row | sector 107 |
| 5th row | sector 22 |
| Value | Count | Frequency (%) |
| sector | 3436 | |
| road | 178 | 2.4% |
| sohna | 166 | 2.3% |
| 85 | 108 | 1.5% |
| 102 | 107 | 1.5% |
| 92 | 100 | 1.4% |
| 69 | 93 | 1.3% |
| 90 | 89 | 1.2% |
| 65 | 87 | 1.2% |
| 81 | 87 | 1.2% |
| Other values (106) | 2899 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 3791 | |
| 3689 | ||
| s | 3681 | |
| r | 3681 | |
| e | 3526 | |
| c | 3487 | |
| t | 3447 | |
| 1 | 1071 | 3.1% |
| 0 | 802 | 2.3% |
| 8 | 778 | 2.3% |
| Other values (21) | 6176 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 34129 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 3791 | |
| 3689 | ||
| s | 3681 | |
| r | 3681 | |
| e | 3526 | |
| c | 3487 | |
| t | 3447 | |
| 1 | 1071 | 3.1% |
| 0 | 802 | 2.3% |
| 8 | 778 | 2.3% |
| Other values (21) | 6176 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 34129 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 3791 | |
| 3689 | ||
| s | 3681 | |
| r | 3681 | |
| e | 3526 | |
| c | 3487 | |
| t | 3447 | |
| 1 | 1071 | 3.1% |
| 0 | 802 | 2.3% |
| 8 | 778 | 2.3% |
| Other values (21) | 6176 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 34129 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 3791 | |
| 3689 | ||
| s | 3681 | |
| r | 3681 | |
| e | 3526 | |
| c | 3487 | |
| t | 3447 | |
| 1 | 1071 | 3.1% |
| 0 | 802 | 2.3% |
| 8 | 778 | 2.3% |
| Other values (21) | 6176 |
price
Real number (ℝ)
High correlation 
| Distinct | 473 |
|---|---|
| Distinct (%) | 12.9% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.5336639 |
| Minimum | 0.07 |
|---|---|
| Maximum | 31.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.2 KiB |
Quantile statistics
| Minimum | 0.07 |
|---|---|
| 5-th percentile | 0.37 |
| Q1 | 0.95 |
| median | 1.52 |
| Q3 | 2.75 |
| 95-th percentile | 8.5 |
| Maximum | 31.5 |
| Range | 31.43 |
| Interquartile range (IQR) | 1.8 |
Descriptive statistics
| Standard deviation | 2.9806235 |
|---|---|
| Coefficient of variation (CV) | 1.1764084 |
| Kurtosis | 14.933373 |
| Mean | 2.5336639 |
| Median Absolute Deviation (MAD) | 0.72 |
| Skewness | 3.2791705 |
| Sum | 9273.21 |
| Variance | 8.8841164 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.25 | 80 | 2.2% |
| 1.2 | 64 | 1.7% |
| 1.5 | 64 | 1.7% |
| 0.9 | 63 | 1.7% |
| 1.1 | 62 | 1.7% |
| 1.4 | 60 | 1.6% |
| 1.3 | 57 | 1.6% |
| 0.95 | 52 | 1.4% |
| 2 | 52 | 1.4% |
| 1.6 | 48 | 1.3% |
| Other values (463) | 3058 |
| Value | Count | Frequency (%) |
| 0.07 | 1 | < 0.1% |
| 0.16 | 1 | < 0.1% |
| 0.17 | 1 | < 0.1% |
| 0.19 | 1 | < 0.1% |
| 0.2 | 8 | |
| 0.21 | 6 | |
| 0.22 | 8 | |
| 0.23 | 1 | < 0.1% |
| 0.24 | 6 | |
| 0.25 | 11 |
| Value | Count | Frequency (%) |
| 31.5 | 1 | < 0.1% |
| 27.5 | 1 | < 0.1% |
| 26 | 2 | |
| 25 | 1 | < 0.1% |
| 24 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 22 | 1 | < 0.1% |
| 20 | 3 | |
| 19.5 | 2 | |
| 19 | 3 |
price_per_sqft
Real number (ℝ)
High correlation 
| Distinct | 2651 |
|---|---|
| Distinct (%) | 72.4% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13892.668 |
| Minimum | 4 |
|---|---|
| Maximum | 600000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.2 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 4715.95 |
| Q1 | 6817.25 |
| median | 9020 |
| Q3 | 13880.5 |
| 95-th percentile | 33333 |
| Maximum | 600000 |
| Range | 599996 |
| Interquartile range (IQR) | 7063.25 |
Descriptive statistics
| Standard deviation | 23210.067 |
|---|---|
| Coefficient of variation (CV) | 1.6706702 |
| Kurtosis | 186.92801 |
| Mean | 13892.668 |
| Median Absolute Deviation (MAD) | 2794 |
| Skewness | 11.43719 |
| Sum | 50847166 |
| Variance | 5.3870722 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000 | 27 | 0.7% |
| 8000 | 19 | 0.5% |
| 5000 | 17 | 0.5% |
| 12500 | 14 | 0.4% |
| 22222 | 13 | 0.4% |
| 6666 | 13 | 0.4% |
| 11111 | 13 | 0.4% |
| 8333 | 12 | 0.3% |
| 7500 | 12 | 0.3% |
| 33333 | 11 | 0.3% |
| Other values (2641) | 3509 |
| Value | Count | Frequency (%) |
| 4 | 1 | |
| 5 | 1 | |
| 7 | 1 | |
| 9 | 1 | |
| 53 | 1 | |
| 57 | 1 | |
| 58 | 2 | |
| 60 | 1 | |
| 61 | 1 | |
| 79 | 1 |
| Value | Count | Frequency (%) |
| 600000 | 1 | |
| 400000 | 1 | |
| 315789 | 1 | |
| 308333 | 1 | |
| 290948 | 1 | |
| 283333 | 1 | |
| 266666 | 1 | |
| 261194 | 1 | |
| 245398 | 1 | |
| 241666 | 1 |
area
Real number (ℝ)
High correlation  Skewed 
| Distinct | 1312 |
|---|---|
| Distinct (%) | 35.8% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2888.3311 |
| Minimum | 50 |
|---|---|
| Maximum | 875000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.2 KiB |
Quantile statistics
| Minimum | 50 |
|---|---|
| 5-th percentile | 518.85 |
| Q1 | 1232.25 |
| median | 1733 |
| Q3 | 2300 |
| 95-th percentile | 4246.2 |
| Maximum | 875000 |
| Range | 874950 |
| Interquartile range (IQR) | 1067.75 |
Descriptive statistics
| Standard deviation | 23167.506 |
|---|---|
| Coefficient of variation (CV) | 8.0210699 |
| Kurtosis | 942.02903 |
| Mean | 2888.3311 |
| Median Absolute Deviation (MAD) | 533 |
| Skewness | 29.730956 |
| Sum | 10571292 |
| Variance | 5.3673333 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1650 | 54 | 1.5% |
| 1350 | 48 | 1.3% |
| 1800 | 47 | 1.3% |
| 3240 | 43 | 1.2% |
| 1950 | 43 | 1.2% |
| 2700 | 39 | 1.1% |
| 900 | 38 | 1.0% |
| 2000 | 33 | 0.9% |
| 2250 | 25 | 0.7% |
| 2400 | 23 | 0.6% |
| Other values (1302) | 3267 |
| Value | Count | Frequency (%) |
| 50 | 4 | |
| 55 | 1 | < 0.1% |
| 56 | 1 | < 0.1% |
| 57 | 1 | < 0.1% |
| 60 | 2 | |
| 61 | 1 | < 0.1% |
| 67 | 2 | |
| 70 | 1 | < 0.1% |
| 72 | 1 | < 0.1% |
| 76 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 875000 | 1 | |
| 642857 | 1 | |
| 620000 | 1 | |
| 566667 | 1 | |
| 215517 | 1 | |
| 98978 | 1 | |
| 82781 | 1 | |
| 65517 | 2 | |
| 65261 | 1 | |
| 58228 | 1 |
areaWithType
Text
| Distinct | 2350 |
|---|---|
| Distinct (%) | 64.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 426.5 KiB |
Length
| Max length | 124 |
|---|---|
| Median length | 119 |
| Mean length | 54.294455 |
| Min length | 12 |
Unique
| Unique | 1847 ? |
|---|---|
| Unique (%) | 50.5% |
Sample
| 1st row | Super Built up area 2691(250 sq.m.)Built Up area: 2460 sq.ft. (228.54 sq.m.)Carpet area: 2100 sq.ft. (195.1 sq.m.) |
|---|---|
| 2nd row | Super Built up area 1854(172.24 sq.m.)Built Up area: 1668 sq.ft. (154.96 sq.m.)Carpet area: 1501 sq.ft. (139.45 sq.m.) |
| 3rd row | Carpet area: 2864 (266.07 sq.m.) |
| 4th row | Super Built up area 2361(219.34 sq.m.) |
| 5th row | Carpet area: 3000 (278.71 sq.m.) |
| Value | Count | Frequency (%) |
| area | 5552 | |
| sq.m | 3639 | |
| up | 3015 | 10.0% |
| built | 2314 | 7.7% |
| super | 1875 | 6.2% |
| sq.ft | 1751 | 5.8% |
| sq.m.)carpet | 1183 | 3.9% |
| sq.m.)built | 699 | 2.3% |
| carpet | 683 | 2.3% |
| plot | 667 | 2.2% |
| Other values (2842) | 8667 |
Most occurring characters
| Value | Count | Frequency (%) |
| 26384 | 13.3% | |
| . | 20321 | 10.2% |
| a | 13105 | 6.6% |
| r | 9428 | 4.7% |
| e | 9297 | 4.7% |
| 1 | 9195 | 4.6% |
| s | 7536 | 3.8% |
| q | 7405 | 3.7% |
| t | 7303 | 3.7% |
| u | 6765 | 3.4% |
| Other values (25) | 82033 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 198772 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 26384 | 13.3% | |
| . | 20321 | 10.2% |
| a | 13105 | 6.6% |
| r | 9428 | 4.7% |
| e | 9297 | 4.7% |
| 1 | 9195 | 4.6% |
| s | 7536 | 3.8% |
| q | 7405 | 3.7% |
| t | 7303 | 3.7% |
| u | 6765 | 3.4% |
| Other values (25) | 82033 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 198772 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 26384 | 13.3% | |
| . | 20321 | 10.2% |
| a | 13105 | 6.6% |
| r | 9428 | 4.7% |
| e | 9297 | 4.7% |
| 1 | 9195 | 4.6% |
| s | 7536 | 3.8% |
| q | 7405 | 3.7% |
| t | 7303 | 3.7% |
| u | 6765 | 3.4% |
| Other values (25) | 82033 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 198772 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 26384 | 13.3% | |
| . | 20321 | 10.2% |
| a | 13105 | 6.6% |
| r | 9428 | 4.7% |
| e | 9297 | 4.7% |
| 1 | 9195 | 4.6% |
| s | 7536 | 3.8% |
| q | 7405 | 3.7% |
| t | 7303 | 3.7% |
| u | 6765 | 3.4% |
| Other values (25) | 82033 |
bedRoom
Real number (ℝ)
High correlation 
| Distinct | 19 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.3477192 |
| Minimum | 1 |
|---|---|
| Maximum | 21 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 21 |
| Range | 20 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.8798873 |
|---|---|
| Coefficient of variation (CV) | 0.56154271 |
| Kurtosis | 18.507139 |
| Mean | 3.3477192 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 3.5006609 |
| Sum | 12256 |
| Variance | 3.5339764 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 1496 | |
| 2 | 941 | |
| 4 | 659 | |
| 5 | 200 | 5.5% |
| 1 | 124 | 3.4% |
| 6 | 73 | 2.0% |
| 9 | 40 | 1.1% |
| 8 | 30 | 0.8% |
| 7 | 28 | 0.8% |
| 12 | 27 | 0.7% |
| Other values (9) | 43 | 1.2% |
| Value | Count | Frequency (%) |
| 1 | 124 | 3.4% |
| 2 | 941 | |
| 3 | 1496 | |
| 4 | 659 | |
| 5 | 200 | 5.5% |
| 6 | 73 | 2.0% |
| 7 | 28 | 0.8% |
| 8 | 30 | 0.8% |
| 9 | 40 | 1.1% |
| 10 | 20 | 0.5% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 19 | 2 | 0.1% |
| 18 | 2 | 0.1% |
| 16 | 11 | |
| 14 | 1 | < 0.1% |
| 13 | 4 | 0.1% |
| 12 | 27 | |
| 11 | 1 | < 0.1% |
| 10 | 20 |
bathroom
Real number (ℝ)
High correlation 
| Distinct | 19 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.4127288 |
| Minimum | 1 |
|---|---|
| Maximum | 21 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 21 |
| Range | 20 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.9290253 |
|---|---|
| Coefficient of variation (CV) | 0.56524424 |
| Kurtosis | 17.902203 |
| Mean | 3.4127288 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 3.265409 |
| Sum | 12494 |
| Variance | 3.7211385 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 1076 | |
| 2 | 1046 | |
| 4 | 818 | |
| 5 | 289 | 7.9% |
| 1 | 155 | 4.2% |
| 6 | 117 | 3.2% |
| 9 | 40 | 1.1% |
| 7 | 38 | 1.0% |
| 8 | 24 | 0.7% |
| 12 | 21 | 0.6% |
| Other values (9) | 37 | 1.0% |
| Value | Count | Frequency (%) |
| 1 | 155 | 4.2% |
| 2 | 1046 | |
| 3 | 1076 | |
| 4 | 818 | |
| 5 | 289 | 7.9% |
| 6 | 117 | 3.2% |
| 7 | 38 | 1.0% |
| 8 | 24 | 0.7% |
| 9 | 40 | 1.1% |
| 10 | 9 | 0.2% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 20 | 3 | 0.1% |
| 18 | 4 | 0.1% |
| 17 | 3 | 0.1% |
| 16 | 7 | 0.2% |
| 14 | 2 | 0.1% |
| 13 | 4 | 0.1% |
| 12 | 21 | |
| 11 | 4 | 0.1% |
| 10 | 9 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.3171265 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3+ |
|---|---|
| 2nd row | 3 |
| 3rd row | 3 |
| 4th row | 3+ |
| 5th row | 3+ |
Common Values
| Value | Count | Frequency (%) |
| 3+ | 1161 | |
| 3 | 1073 | |
| 2 | 882 | |
| 1 | 364 | 9.9% |
| 0 | 181 | 4.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 2234 | |
| 2 | 882 | 24.1% |
| 1 | 364 | 9.9% |
| 0 | 181 | 4.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 2234 | |
| + | 1161 | |
| 2 | 882 | 18.3% |
| 1 | 364 | 7.5% |
| 0 | 181 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4822 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3 | 2234 | |
| + | 1161 | |
| 2 | 882 | 18.3% |
| 1 | 364 | 7.5% |
| 0 | 181 | 3.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4822 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3 | 2234 | |
| + | 1161 | |
| 2 | 882 | 18.3% |
| 1 | 364 | 7.5% |
| 0 | 181 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4822 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3 | 2234 | |
| + | 1161 | |
| 2 | 882 | 18.3% |
| 1 | 364 | 7.5% |
| 0 | 181 | 3.8% |
floorNum
Real number (ℝ)
Zeros 
| Distinct | 43 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 19 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.8163097 |
| Minimum | 0 |
|---|---|
| Maximum | 51 |
| Zeros | 129 |
| Zeros (%) | 3.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 5 |
| Q3 | 10 |
| 95-th percentile | 18 |
| Maximum | 51 |
| Range | 51 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 6.0191061 |
|---|---|
| Coefficient of variation (CV) | 0.88304468 |
| Kurtosis | 4.4935142 |
| Mean | 6.8163097 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.6881491 |
| Sum | 24825 |
| Variance | 36.229638 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 493 | |
| 2 | 488 | |
| 1 | 349 | 9.5% |
| 4 | 312 | 8.5% |
| 8 | 195 | 5.3% |
| 6 | 183 | 5.0% |
| 10 | 179 | 4.9% |
| 7 | 176 | 4.8% |
| 5 | 169 | 4.6% |
| 9 | 161 | 4.4% |
| Other values (33) | 937 |
| Value | Count | Frequency (%) |
| 0 | 129 | 3.5% |
| 1 | 349 | |
| 2 | 488 | |
| 3 | 493 | |
| 4 | 312 | |
| 5 | 169 | 4.6% |
| 6 | 183 | 5.0% |
| 7 | 176 | 4.8% |
| 8 | 195 | 5.3% |
| 9 | 161 | 4.4% |
| Value | Count | Frequency (%) |
| 51 | 1 | < 0.1% |
| 45 | 1 | < 0.1% |
| 44 | 1 | < 0.1% |
| 43 | 2 | |
| 40 | 1 | < 0.1% |
| 39 | 2 | |
| 38 | 1 | < 0.1% |
| 35 | 2 | |
| 34 | 2 | |
| 33 | 4 |
facing
Categorical
High correlation  Missing 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 1039 |
| Missing (%) | 28.4% |
| Memory size | 257.0 KiB |
| East | |
|---|---|
| North-East | |
| North | |
| West | |
| South | |
| Other values (3) |
Length
| Max length | 10 |
|---|---|
| Median length | 5 |
| Mean length | 6.8371472 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | East |
|---|---|
| 2nd row | North-West |
| 3rd row | North-West |
| 4th row | East |
| 5th row | South-East |
Common Values
| Value | Count | Frequency (%) |
| East | 621 | |
| North-East | 621 | |
| North | 386 | 10.5% |
| West | 247 | 6.7% |
| South | 231 | 6.3% |
| North-West | 192 | 5.2% |
| South-East | 171 | 4.7% |
| South-West | 153 | 4.2% |
| (Missing) | 1039 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| east | 621 | |
| north-east | 621 | |
| north | 386 | |
| west | 247 | 9.4% |
| south | 231 | 8.8% |
| north-west | 192 | 7.3% |
| south-east | 171 | 6.5% |
| south-west | 153 | 5.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 3759 | |
| s | 2005 | |
| o | 1754 | |
| h | 1754 | |
| E | 1413 | 7.9% |
| a | 1413 | 7.9% |
| N | 1199 | 6.7% |
| r | 1199 | 6.7% |
| - | 1137 | 6.3% |
| W | 592 | 3.3% |
| Other values (3) | 1702 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 17927 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 3759 | |
| s | 2005 | |
| o | 1754 | |
| h | 1754 | |
| E | 1413 | 7.9% |
| a | 1413 | 7.9% |
| N | 1199 | 6.7% |
| r | 1199 | 6.7% |
| - | 1137 | 6.3% |
| W | 592 | 3.3% |
| Other values (3) | 1702 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 17927 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 3759 | |
| s | 2005 | |
| o | 1754 | |
| h | 1754 | |
| E | 1413 | 7.9% |
| a | 1413 | 7.9% |
| N | 1199 | 6.7% |
| r | 1199 | 6.7% |
| - | 1137 | 6.3% |
| W | 592 | 3.3% |
| Other values (3) | 1702 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 17927 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 3759 | |
| s | 2005 | |
| o | 1754 | |
| h | 1754 | |
| E | 1413 | 7.9% |
| a | 1413 | 7.9% |
| N | 1199 | 6.7% |
| r | 1199 | 6.7% |
| - | 1137 | 6.3% |
| W | 592 | 3.3% |
| Other values (3) | 1702 |
agePossession
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 280.2 KiB |
| Relatively New | |
|---|---|
| New Property | |
| Moderately Old | |
| Undefined | |
| Old Property |
Length
| Max length | 18 |
|---|---|
| Median length | 14 |
| Mean length | 13.367113 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Relatively New |
|---|---|
| 2nd row | Moderately Old |
| 3rd row | Relatively New |
| 4th row | Relatively New |
| 5th row | New Property |
Common Values
| Value | Count | Frequency (%) |
| Relatively New | 1640 | |
| New Property | 590 | 16.1% |
| Moderately Old | 558 | 15.2% |
| Undefined | 313 | 8.5% |
| Old Property | 302 | 8.2% |
| Under Construction | 258 | 7.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| new | 2230 | |
| relatively | 1640 | |
| property | 892 | 12.7% |
| old | 860 | 12.3% |
| moderately | 558 | 8.0% |
| undefined | 313 | 4.5% |
| under | 258 | 3.7% |
| construction | 258 | 3.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 8402 | |
| l | 4698 | 9.6% |
| t | 3606 | 7.4% |
| 3348 | 6.8% | |
| y | 3090 | 6.3% |
| r | 2858 | 5.8% |
| d | 2302 | 4.7% |
| N | 2230 | 4.6% |
| w | 2230 | 4.6% |
| i | 2211 | 4.5% |
| Other values (15) | 13962 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 48937 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 8402 | |
| l | 4698 | 9.6% |
| t | 3606 | 7.4% |
| 3348 | 6.8% | |
| y | 3090 | 6.3% |
| r | 2858 | 5.8% |
| d | 2302 | 4.7% |
| N | 2230 | 4.6% |
| w | 2230 | 4.6% |
| i | 2211 | 4.5% |
| Other values (15) | 13962 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 48937 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 8402 | |
| l | 4698 | 9.6% |
| t | 3606 | 7.4% |
| 3348 | 6.8% | |
| y | 3090 | 6.3% |
| r | 2858 | 5.8% |
| d | 2302 | 4.7% |
| N | 2230 | 4.6% |
| w | 2230 | 4.6% |
| i | 2211 | 4.5% |
| Other values (15) | 13962 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 48937 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 8402 | |
| l | 4698 | 9.6% |
| t | 3606 | 7.4% |
| 3348 | 6.8% | |
| y | 3090 | 6.3% |
| r | 2858 | 5.8% |
| d | 2302 | 4.7% |
| N | 2230 | 4.6% |
| w | 2230 | 4.6% |
| i | 2211 | 4.5% |
| Other values (15) | 13962 |
super_built_up_area
Real number (ℝ)
High correlation  Missing 
| Distinct | 593 |
|---|---|
| Distinct (%) | 31.6% |
| Missing | 1786 |
| Missing (%) | 48.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1925.2376 |
| Minimum | 89 |
|---|---|
| Maximum | 10000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.2 KiB |
Quantile statistics
| Minimum | 89 |
|---|---|
| 5-th percentile | 767 |
| Q1 | 1479.5 |
| median | 1828 |
| Q3 | 2215 |
| 95-th percentile | 3185 |
| Maximum | 10000 |
| Range | 9911 |
| Interquartile range (IQR) | 735.5 |
Descriptive statistics
| Standard deviation | 764.17218 |
|---|---|
| Coefficient of variation (CV) | 0.39692356 |
| Kurtosis | 10.349191 |
| Mean | 1925.2376 |
| Median Absolute Deviation (MAD) | 372 |
| Skewness | 1.8364563 |
| Sum | 3609820.5 |
| Variance | 583959.12 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1950 | 37 | 1.0% |
| 1650 | 37 | 1.0% |
| 2000 | 25 | 0.7% |
| 1578 | 25 | 0.7% |
| 1640 | 22 | 0.6% |
| 2150 | 22 | 0.6% |
| 2408 | 19 | 0.5% |
| 1900 | 19 | 0.5% |
| 1930 | 18 | 0.5% |
| 1350 | 17 | 0.5% |
| Other values (583) | 1634 | |
| (Missing) | 1786 |
| Value | Count | Frequency (%) |
| 89 | 1 | |
| 145 | 1 | |
| 161 | 1 | |
| 215 | 1 | |
| 216 | 1 | |
| 325 | 1 | |
| 340 | 1 | |
| 352 | 1 | |
| 380 | 1 | |
| 406 | 1 |
| Value | Count | Frequency (%) |
| 10000 | 1 | |
| 6926 | 1 | |
| 6000 | 1 | |
| 5800 | 2 | |
| 5514 | 1 | |
| 5350 | 2 | |
| 5200 | 2 | |
| 4890 | 1 | |
| 4857 | 1 | |
| 4848 | 2 |
built_up_area
Real number (ℝ)
High correlation  Missing  Skewed 
| Distinct | 641 |
|---|---|
| Distinct (%) | 38.3% |
| Missing | 1987 |
| Missing (%) | 54.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2386.1989 |
| Minimum | 2 |
|---|---|
| Maximum | 737147 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.2 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 240.65 |
| Q1 | 1110.5 |
| median | 1650 |
| Q3 | 2400 |
| 95-th percentile | 4660.5 |
| Maximum | 737147 |
| Range | 737145 |
| Interquartile range (IQR) | 1289.5 |
Descriptive statistics
| Standard deviation | 18026.927 |
|---|---|
| Coefficient of variation (CV) | 7.5546621 |
| Kurtosis | 1652.6076 |
| Mean | 2386.1989 |
| Median Absolute Deviation (MAD) | 619.5 |
| Skewness | 40.523095 |
| Sum | 3994497 |
| Variance | 3.2497009 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1800 | 41 | 1.1% |
| 3240 | 37 | 1.0% |
| 1900 | 34 | 0.9% |
| 1350 | 33 | 0.9% |
| 2700 | 32 | 0.9% |
| 900 | 28 | 0.8% |
| 1600 | 26 | 0.7% |
| 2000 | 24 | 0.7% |
| 1300 | 24 | 0.7% |
| 1700 | 23 | 0.6% |
| Other values (631) | 1372 | |
| (Missing) | 1987 |
| Value | Count | Frequency (%) |
| 2 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 50 | 3 | |
| 53 | 1 | < 0.1% |
| 55 | 1 | < 0.1% |
| 56 | 1 | < 0.1% |
| 57 | 1 | < 0.1% |
| 60 | 5 |
| Value | Count | Frequency (%) |
| 737147 | 1 | < 0.1% |
| 13500 | 1 | < 0.1% |
| 11286 | 1 | < 0.1% |
| 9500 | 1 | < 0.1% |
| 9000 | 7 | |
| 8775 | 1 | < 0.1% |
| 8286 | 1 | < 0.1% |
| 8067.8 | 1 | < 0.1% |
| 8000 | 1 | < 0.1% |
| 7500 | 2 | 0.1% |
carpet_area
Real number (ℝ)
High correlation  Missing  Skewed 
| Distinct | 732 |
|---|---|
| Distinct (%) | 39.1% |
| Missing | 1791 |
| Missing (%) | 48.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2531.5396 |
| Minimum | 15 |
|---|---|
| Maximum | 607936 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.2 KiB |
Quantile statistics
| Minimum | 15 |
|---|---|
| 5-th percentile | 350 |
| Q1 | 845 |
| median | 1300 |
| Q3 | 1790 |
| 95-th percentile | 2950 |
| Maximum | 607936 |
| Range | 607921 |
| Interquartile range (IQR) | 945 |
Descriptive statistics
| Standard deviation | 22811.918 |
|---|---|
| Coefficient of variation (CV) | 9.0110847 |
| Kurtosis | 603.89238 |
| Mean | 2531.5396 |
| Median Absolute Deviation (MAD) | 470 |
| Skewness | 24.320314 |
| Sum | 4733979 |
| Variance | 5.2038359 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1400 | 42 | 1.1% |
| 1600 | 35 | 1.0% |
| 1800 | 35 | 1.0% |
| 1200 | 31 | 0.8% |
| 1500 | 29 | 0.8% |
| 1650 | 28 | 0.8% |
| 1350 | 27 | 0.7% |
| 1300 | 23 | 0.6% |
| 1000 | 22 | 0.6% |
| 1450 | 22 | 0.6% |
| Other values (722) | 1576 | |
| (Missing) | 1791 |
| Value | Count | Frequency (%) |
| 15 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 48 | 1 | < 0.1% |
| 50 | 1 | < 0.1% |
| 59 | 1 | < 0.1% |
| 60 | 1 | < 0.1% |
| 66 | 1 | < 0.1% |
| 72 | 1 | < 0.1% |
| 76.44 | 3 | |
| 77.31 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 607936 | 1 | |
| 569243 | 1 | |
| 514396 | 1 | |
| 64529 | 1 | |
| 64412 | 1 | |
| 58141 | 1 | |
| 54917 | 1 | |
| 48811 | 1 | |
| 45966 | 1 | |
| 34401 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2964 | |
| 1 | 697 | 19.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2964 | |
| 1 | 697 | 19.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2964 | |
| 1 | 697 | 19.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3661 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2964 | |
| 1 | 697 | 19.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3661 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2964 | |
| 1 | 697 | 19.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3661 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2964 | |
| 1 | 697 | 19.0% |
servant room
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 236.0 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2342 | |
| 1 | 1319 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2342 | |
| 1 | 1319 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2342 | |
| 1 | 1319 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3661 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2342 | |
| 1 | 1319 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3661 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2342 | |
| 1 | 1319 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3661 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2342 | |
| 1 | 1319 |
store room
Categorical
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 236.0 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3326 | |
| 1 | 335 | 9.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 3326 | |
| 1 | 335 | 9.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3326 | |
| 1 | 335 | 9.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3661 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3326 | |
| 1 | 335 | 9.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3661 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3326 | |
| 1 | 335 | 9.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3661 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3326 | |
| 1 | 335 | 9.2% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3011 | |
| 1 | 650 | 17.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 3011 | |
| 1 | 650 | 17.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3011 | |
| 1 | 650 | 17.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3661 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3011 | |
| 1 | 650 | 17.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3661 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3011 | |
| 1 | 650 | 17.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3661 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3011 | |
| 1 | 650 | 17.8% |
others
Categorical
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 236.0 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3259 | |
| 1 | 402 | 11.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 3259 | |
| 1 | 402 | 11.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3259 | |
| 1 | 402 | 11.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3661 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3259 | |
| 1 | 402 | 11.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3661 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3259 | |
| 1 | 402 | 11.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3661 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3259 | |
| 1 | 402 | 11.0% |
furnishing_type
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 236.0 KiB |
| 0 | |
|---|---|
| 2 | |
| 1 | 203 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2403 | |
| 2 | 1055 | |
| 1 | 203 | 5.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2403 | |
| 2 | 1055 | |
| 1 | 203 | 5.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2403 | |
| 2 | 1055 | |
| 1 | 203 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3661 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2403 | |
| 2 | 1055 | |
| 1 | 203 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3661 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2403 | |
| 2 | 1055 | |
| 1 | 203 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3661 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2403 | |
| 2 | 1055 | |
| 1 | 203 | 5.5% |
luxury_score
Real number (ℝ)
Zeros 
| Distinct | 161 |
|---|---|
| Distinct (%) | 4.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 71.544387 |
| Minimum | 0 |
|---|---|
| Maximum | 174 |
| Zeros | 459 |
| Zeros (%) | 12.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 31 |
| median | 59 |
| Q3 | 110 |
| 95-th percentile | 174 |
| Maximum | 174 |
| Range | 174 |
| Interquartile range (IQR) | 79 |
Descriptive statistics
| Standard deviation | 53.062737 |
|---|---|
| Coefficient of variation (CV) | 0.74167576 |
| Kurtosis | -0.87973169 |
| Mean | 71.544387 |
| Median Absolute Deviation (MAD) | 38 |
| Skewness | 0.45928832 |
| Sum | 261924 |
| Variance | 2815.6541 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 459 | 12.5% |
| 49 | 347 | 9.5% |
| 174 | 195 | 5.3% |
| 44 | 60 | 1.6% |
| 165 | 55 | 1.5% |
| 38 | 55 | 1.5% |
| 72 | 52 | 1.4% |
| 60 | 47 | 1.3% |
| 42 | 45 | 1.2% |
| 37 | 45 | 1.2% |
| Other values (151) | 2301 |
| Value | Count | Frequency (%) |
| 0 | 459 | |
| 5 | 6 | 0.2% |
| 6 | 6 | 0.2% |
| 7 | 41 | 1.1% |
| 8 | 30 | 0.8% |
| 9 | 9 | 0.2% |
| 12 | 6 | 0.2% |
| 13 | 10 | 0.3% |
| 14 | 12 | 0.3% |
| 15 | 42 | 1.1% |
| Value | Count | Frequency (%) |
| 174 | 195 | |
| 169 | 1 | < 0.1% |
| 168 | 9 | 0.2% |
| 167 | 21 | 0.6% |
| 166 | 10 | 0.3% |
| 165 | 55 | 1.5% |
| 161 | 3 | 0.1% |
| 160 | 27 | 0.7% |
| 159 | 23 | 0.6% |
| 158 | 34 | 0.9% |
Interactions
Correlations
| agePossession | area | balcony | bathroom | bedRoom | built_up_area | carpet_area | facing | floorNum | furnishing_type | luxury_score | others | pooja room | price | price_per_sqft | property_type | servant room | store room | study room | super_built_up_area | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| agePossession | 1.000 | 0.000 | 0.271 | 0.110 | 0.129 | 0.000 | 0.000 | 0.092 | 0.127 | 0.214 | 0.254 | 0.106 | 0.187 | 0.102 | 0.055 | 0.391 | 0.286 | 0.142 | 0.134 | 0.086 |
| area | 0.000 | 1.000 | 0.011 | 0.687 | 0.624 | 0.835 | 0.801 | 0.022 | 0.116 | 0.043 | 0.259 | 0.042 | 0.037 | 0.744 | 0.207 | 0.028 | 0.015 | 0.039 | 0.018 | 0.948 |
| balcony | 0.271 | 0.011 | 1.000 | 0.224 | 0.174 | 0.000 | 0.026 | 0.016 | 0.079 | 0.178 | 0.224 | 0.080 | 0.194 | 0.136 | 0.033 | 0.212 | 0.439 | 0.144 | 0.180 | 0.306 |
| bathroom | 0.110 | 0.687 | 0.224 | 1.000 | 0.862 | 0.486 | 0.604 | 0.039 | -0.003 | 0.196 | 0.180 | 0.064 | 0.284 | 0.720 | 0.411 | 0.469 | 0.519 | 0.244 | 0.170 | 0.819 |
| bedRoom | 0.129 | 0.624 | 0.174 | 0.862 | 1.000 | 0.397 | 0.574 | 0.029 | -0.100 | 0.167 | 0.058 | 0.073 | 0.288 | 0.681 | 0.417 | 0.592 | 0.316 | 0.224 | 0.148 | 0.800 |
| built_up_area | 0.000 | 0.835 | 0.000 | 0.486 | 0.397 | 1.000 | 0.969 | 1.000 | 0.090 | 0.089 | 0.294 | 0.000 | 0.000 | 0.605 | 0.132 | 0.000 | 0.000 | 0.000 | 0.000 | 0.926 |
| carpet_area | 0.000 | 0.801 | 0.026 | 0.604 | 0.574 | 0.969 | 1.000 | 0.000 | 0.157 | 0.000 | 0.239 | 0.017 | 0.000 | 0.613 | 0.136 | 0.000 | 0.000 | 0.000 | 0.004 | 0.894 |
| facing | 0.092 | 0.022 | 0.016 | 0.039 | 0.029 | 1.000 | 0.000 | 1.000 | 0.000 | 0.049 | 0.065 | 0.000 | 0.032 | 0.021 | 0.000 | 0.093 | 0.035 | 0.037 | 0.000 | 0.000 |
| floorNum | 0.127 | 0.116 | 0.079 | -0.003 | -0.100 | 0.090 | 0.157 | 0.000 | 1.000 | 0.019 | 0.232 | 0.033 | 0.101 | 0.001 | -0.126 | 0.482 | 0.086 | 0.111 | 0.076 | 0.152 |
| furnishing_type | 0.214 | 0.043 | 0.178 | 0.196 | 0.167 | 0.089 | 0.000 | 0.049 | 0.019 | 1.000 | 0.244 | 0.057 | 0.216 | 0.176 | 0.023 | 0.079 | 0.271 | 0.155 | 0.140 | 0.133 |
| luxury_score | 0.254 | 0.259 | 0.224 | 0.180 | 0.058 | 0.294 | 0.239 | 0.065 | 0.232 | 0.244 | 1.000 | 0.175 | 0.187 | 0.215 | 0.054 | 0.330 | 0.346 | 0.229 | 0.180 | 0.222 |
| others | 0.106 | 0.042 | 0.080 | 0.064 | 0.073 | 0.000 | 0.017 | 0.000 | 0.033 | 0.057 | 0.175 | 1.000 | 0.028 | 0.034 | 0.036 | 0.024 | 0.000 | 0.106 | 0.026 | 0.084 |
| pooja room | 0.187 | 0.037 | 0.194 | 0.284 | 0.288 | 0.000 | 0.000 | 0.032 | 0.101 | 0.216 | 0.187 | 0.028 | 1.000 | 0.334 | 0.043 | 0.250 | 0.249 | 0.305 | 0.309 | 0.157 |
| price | 0.102 | 0.744 | 0.136 | 0.720 | 0.681 | 0.605 | 0.613 | 0.021 | 0.001 | 0.176 | 0.215 | 0.034 | 0.334 | 1.000 | 0.744 | 0.543 | 0.369 | 0.303 | 0.244 | 0.772 |
| price_per_sqft | 0.055 | 0.207 | 0.033 | 0.411 | 0.417 | 0.132 | 0.136 | 0.000 | -0.126 | 0.023 | 0.054 | 0.036 | 0.043 | 0.744 | 1.000 | 0.201 | 0.044 | 0.000 | 0.030 | 0.287 |
| property_type | 0.391 | 0.028 | 0.212 | 0.469 | 0.592 | 0.000 | 0.000 | 0.093 | 0.482 | 0.079 | 0.330 | 0.024 | 0.250 | 0.543 | 0.201 | 1.000 | 0.062 | 0.241 | 0.123 | 1.000 |
| servant room | 0.286 | 0.015 | 0.439 | 0.519 | 0.316 | 0.000 | 0.000 | 0.035 | 0.086 | 0.271 | 0.346 | 0.000 | 0.249 | 0.369 | 0.044 | 0.062 | 1.000 | 0.159 | 0.181 | 0.584 |
| store room | 0.142 | 0.039 | 0.144 | 0.244 | 0.224 | 0.000 | 0.000 | 0.037 | 0.111 | 0.155 | 0.229 | 0.106 | 0.305 | 0.303 | 0.000 | 0.241 | 0.159 | 1.000 | 0.226 | 0.046 |
| study room | 0.134 | 0.018 | 0.180 | 0.170 | 0.148 | 0.000 | 0.004 | 0.000 | 0.076 | 0.140 | 0.180 | 0.026 | 0.309 | 0.244 | 0.030 | 0.123 | 0.181 | 0.226 | 1.000 | 0.121 |
| super_built_up_area | 0.086 | 0.948 | 0.306 | 0.819 | 0.800 | 0.926 | 0.894 | 0.000 | 0.152 | 0.133 | 0.222 | 0.084 | 0.157 | 0.772 | 0.287 | 1.000 | 0.584 | 0.046 | 0.121 | 1.000 |
Missing values
Sample
| property_type | society | sector | price | price_per_sqft | area | areaWithType | bedRoom | bathroom | balcony | floorNum | facing | agePossession | super_built_up_area | built_up_area | carpet_area | study room | servant room | store room | pooja room | others | furnishing_type | luxury_score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | flat | la vida by tata housing | sector 113 | 3.00 | 14285.0 | 2100.0 | Super Built up area 2691(250 sq.m.)Built Up area: 2460 sq.ft. (228.54 sq.m.)Carpet area: 2100 sq.ft. (195.1 sq.m.) | 3 | 3 | 3+ | 3.0 | East | Relatively New | 2691.0 | 2460.0 | 2100.0 | 0 | 1 | 0 | 0 | 0 | 0 | 167 |
| 1 | flat | umang monsoon breeze | sector 78 | 0.88 | 4746.0 | 1854.0 | Super Built up area 1854(172.24 sq.m.)Built Up area: 1668 sq.ft. (154.96 sq.m.)Carpet area: 1501 sq.ft. (139.45 sq.m.) | 3 | 4 | 3 | 8.0 | North-West | Moderately Old | 1854.0 | 1668.0 | 1501.0 | 0 | 1 | 0 | 1 | 0 | 0 | 111 |
| 2 | flat | ireo the grand arch | sector 58 | 4.85 | 16934.0 | 2864.0 | Carpet area: 2864 (266.07 sq.m.) | 4 | 5 | 3 | 23.0 | North-West | Relatively New | NaN | NaN | 2864.0 | 1 | 1 | 0 | 1 | 1 | 2 | 49 |
| 3 | flat | m3m woodshire | sector 107 | 1.49 | 6310.0 | 2361.0 | Super Built up area 2361(219.34 sq.m.) | 3 | 4 | 3+ | 5.0 | East | Relatively New | 2361.0 | NaN | NaN | 0 | 1 | 1 | 0 | 0 | 2 | 174 |
| 4 | flat | ambience creacions | sector 22 | 6.16 | 20533.0 | 3000.0 | Carpet area: 3000 (278.71 sq.m.) | 4 | 5 | 3+ | 10.0 | South-East | New Property | NaN | NaN | 3000.0 | 0 | 1 | 0 | 1 | 0 | 2 | 49 |
| 5 | flat | godrej aria | sector 79 | 1.45 | 9062.0 | 1600.0 | Super Built up area 1495(138.89 sq.m.)Built Up area: 1494 sq.ft. (138.8 sq.m.) | 2 | 2 | 3+ | 9.0 | East | Relatively New | 1495.0 | 1494.0 | NaN | 1 | 0 | 0 | 0 | 0 | 0 | 113 |
| 6 | flat | tulip violet | sector 69 | 1.55 | 9822.0 | 1578.0 | Super Built up area 1578(146.6 sq.m.) | 3 | 3 | 2 | 9.0 | North-East | Relatively New | 1578.0 | NaN | NaN | 0 | 0 | 0 | 1 | 0 | 2 | 165 |
| 7 | flat | smart world gems | sector 89 | 1.20 | 8432.0 | 1423.0 | Carpet area: 1423 (132.2 sq.m.) | 3 | 3 | 2 | 2.0 | North-East | New Property | NaN | NaN | 1423.0 | 0 | 0 | 0 | 0 | 0 | 0 | 44 |
| 8 | flat | kiran residency | sector 56 | 1.55 | 7750.0 | 2000.0 | Super Built up area 2000(185.81 sq.m.) | 3 | 3 | 3 | 5.0 | North | Old Property | 2000.0 | NaN | NaN | 0 | 1 | 0 | 0 | 0 | 2 | 52 |
| 9 | house | emaar mgf marbella | sector 66 | 12.00 | 38095.0 | 3150.0 | Plot area 350(292.64 sq.m.) | 5 | 6 | 3+ | 3.0 | North-East | Relatively New | NaN | 3150.0 | NaN | 0 | 1 | 1 | 0 | 0 | 2 | 153 |
| property_type | society | sector | price | price_per_sqft | area | areaWithType | bedRoom | bathroom | balcony | floorNum | facing | agePossession | super_built_up_area | built_up_area | carpet_area | study room | servant room | store room | pooja room | others | furnishing_type | luxury_score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3788 | house | Independent | sector 25 | 13.50 | 37313.0 | 3618.0 | Plot area 402(336.12 sq.m.) | 5 | 6 | 3+ | 4.0 | North-East | Old Property | NaN | 3618.0 | NaN | 0 | 0 | 0 | 0 | 1 | 2 | 79 |
| 3789 | house | greenopolis | sector 89 | 0.70 | 5397.0 | 1297.0 | Built Up area: 1297 (120.5 sq.m.) | 2 | 2 | 2 | 14.0 | North-East | Undefined | NaN | 1297.0 | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 3791 | flat | emaar digihomes | sector 62 | 2.10 | 14254.0 | 1473.0 | Super Built up area 1508.26(140.12 sq.m.) | 2 | 2 | 2 | 30.0 | North | New Property | 1508.26 | NaN | NaN | 0 | 1 | 0 | 0 | 0 | 0 | 52 |
| 3792 | flat | dlf new town heights | sector 90 | 1.80 | 6600.0 | 2727.0 | Super Built up area 2727(253.35 sq.m.) | 4 | 4 | 3+ | 23.0 | NaN | Moderately Old | 2727.00 | NaN | NaN | 0 | 0 | 0 | 0 | 0 | 0 | 67 |
| 3793 | house | international city by sobha phase 2 | sector 109 | 8.48 | 23556.0 | 3600.0 | Plot area 400(334.45 sq.m.) | 4 | 6 | 3+ | 3.0 | North-East | New Property | NaN | 3600.0 | NaN | 1 | 1 | 1 | 1 | 0 | 0 | 153 |
| 3794 | flat | signature global park 4 | sector 36 | 1.00 | 9900.0 | 1010.0 | Carpet area: 1010 (93.83 sq.m.) | 3 | 2 | 3 | 2.0 | NaN | New Property | NaN | NaN | 1010.0 | 0 | 0 | 0 | 0 | 0 | 0 | 128 |
| 3795 | house | suncity essel towers | sector 28 | 6.00 | 22222.0 | 2700.0 | Plot area 300(250.84 sq.m.) | 4 | 4 | 3+ | 3.0 | East | Old Property | NaN | 2700.0 | NaN | 1 | 0 | 0 | 0 | 0 | 2 | 89 |
| 3796 | flat | sobha city | sector 108 | 4.10 | 23401.0 | 1752.0 | Super Built up area 2344(217.76 sq.m.)Built Up area: 2343 sq.ft. (217.67 sq.m.)Carpet area: 1752 sq.ft. (162.77 sq.m.) | 4 | 5 | 3 | 25.0 | North-East | New Property | 2344.00 | 2343.0 | 1752.0 | 1 | 1 | 0 | 0 | 0 | 1 | 97 |
| 3797 | flat | vatika emilia floors | sector 83 | 0.60 | 5714.0 | 1050.0 | Super Built up area 1050(97.55 sq.m.)Built Up area: 990 sq.ft. (91.97 sq.m.)Carpet area: 950 sq.ft. (88.26 sq.m.) | 2 | 2 | 2 | 1.0 | South | Relatively New | 1050.00 | 990.0 | 950.0 | 0 | 0 | 0 | 0 | 0 | 2 | 167 |
| 3798 | flat | dlf new town heights | sector 90 | 1.55 | 6556.0 | 2364.0 | Super Built up area 2364(219.62 sq.m.) | 4 | 4 | 3+ | 1.0 | South-East | Relatively New | 2364.00 | NaN | NaN | 0 | 1 | 0 | 1 | 0 | 0 | 81 |